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(5)  INTRODUCTION 

Alternative  splicing  has  the  capacity  to  generate  a  number  of  RNA  iso  forms  from 
a  single  gene.  The  cell  adhesion  molecule  CD44  can  theoretically  be  alternatively  spliced 
to  form  more  than  one  thousand  different  mRNAs  (8,  10,  21).  The  CD44  alternative 
exons,  all  in  the  same  translational  reading  frame,  reside  within  a  single  block  in  the 
middle  of  the  gene  (Fig.  1A).  Interest  in  CD44  splicing  peaked  when  it  was  observed  that 
inclusion  of  one  alternative  exon  was  causative  for  metastasis  in  a  rat  model  of  colon 
cancer  (9).  CD44,  is  a  widely  expressed  cell  adhesion  molecule  and  trans-membrane 
glycoprotein  that  mediates  a  variety  of  cell-cell  and  cell  matrix  interactions  (6, 16,  22).  A 
number  of  CD44  isoforms  arising  from  alternative  splicing  are  expressed  in  normal 
tissues  (8,  10,  21).  Recently  it  has  been  shown  that  alternative  splicing  of  CD44 
responds  to  signal  transduction.  Treatment  of  T-lymphoma  cells  that  normally  express  a 
form  of  CD44  lacking  variable  exons  with  phorbol  esters,  concanavalin  A,  or  c-ras 
induces  inclusion  of  variable  exons  (13). 

We  and  others  have  recently  observed  that  CD44  alternative  splicing  increases  in 
cancer  cells  (2,  5,  23).  There  are  simultaneous  alterations  in  expression  of  certain 
alternative  splicing  factors,  including  the  well-studied  serine-arginine-rich  (SR)  proteins 
(7,  23).  SR  proteins  are  known  to  bind  to  splicing  enhancer  sequences  located  within 
exons  and  thereby  stimulate  the  binding  of  constitutive  splicing  factors  such  as  U2 
snRNP-associated  factor  (U2AF)  and  U1  snRNPs  (1,  17,  20).  The  transformer  2  protein 
(Tra2)  is  one  such  SR  protein  (11,  14,  25).  Discovered  in  Drosophila  melanogaster,  Tra2 
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regulates  alternative  splicing  in  the  doublesex  gene  by  binding  to  CA-rich  enhancer 
sequences  (15)  within  the  alternative  exon  in  conjunction  with  another  SR  protein,  tra. 

Recently  the  human  orthologs  of  Tra2  have  been  cloned  and  characterized  (3,  24). 
Two  very  similar  genes  Tra2a  and  Tra2(l  exist  in  humans.  Both  have  the  capacity  to 
bind  to  purine-rich  exon  enhancer  sequences  although  a  natural  target  gene  for  either 
human  isoform  of  Tra2  is  unknown.  Examination  of  the  sequences  of  CD44  variable 
exons  revealed  a  number  of  exon  sequences  that  resembled  the  CA-rich  exon  enhancer 
from  the  doublesex  gene  (Fig.  1B-D),  suggesting  a  potential  role  for  human  Tra2  in  the 
splicing  of  CD44.  Here  we  used  a  mini-gene  containing  CD44  variable  exons  v4  and  v5 
to  study  the  ability  of  SR  proteins,  including  Tra2,  to  influence  CD44  splicing.  We  show 
that  Tra2a  enhances  exon  inclusion  of  both  variable  exons  4  and  5  in  HeLa  cells.  In 
contrast,  other  SR  proteins,  including  SRp55,  ASF/SF2,  and  SC35  had  no  effect  on  the 
splicing  of  these  two  exons.  One  other  SR  protein,  SRp75,  also  had  an  effect  on  splicing 
of  these  exons  but  produced  a  different  phenotype  in  which  only  one  of  the  two  exons 
was  included  in  the  majority  of  mRNAs.  And  finally,  SRp20,  also  stimulated  splicing  of 
v4  and  v5,  although  to  a  lesser  extent  that  Tra2. 

In  previous  studies  of  the  alteration  of  SR  protein  expression  in  tumors,  the 
expression  of  Tra2  had  not  been  examined.  Using  a  mouse  model  of  mammary 
tumorigenesis,  we  followed  the  expression  of  Tra2  in  both  normal  development  and 
neoplasia.  Tra2  was  induced  during  pregnancy  and  in  preneoplasias.  Levels  were  even 
higher  in  neoplasias  and  their  metastases.  CD44  alternative  splicing,  especially  of 
variable  exons  4  and  5,  paralleled  the  expression  pattern  of  Tra2,  suggesting  that  changes 
in  SR  protein  levels  during  tumorigenesis  may  contribute  to  CD44  alternative  splicing. 
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(6)  BODY 

Experimental  Methods,  Assumptions,  and  Procedures 

Plasmids  and  transfections.  The  reporter  CD44  mini-gene  used  throughout  this  study 
(diagrammed  in  Figure  2B)  was  created  by  inserting  sequences  from  the  human  CD44 
gene  into  the  first  exon  of  an  engineered  P-globin  in  vivo  expression  plasmid,  Dup  33  (4) 
obtained  from  R.  Kole,  University  of  North  Carolina,  Chapel  Hill,  NC.  The  CD44 
sequence  spanned  a  sequence  from  792  nucleotides  before  variable  exon  4  to  515 
nucleotides  after  variable  exon  5  and  included  all  of  the  natural  exon  between  exons  4 
and  5.  The  SR  protein  expression  plasmids  were  obtained  from  W.  Mattox,  M.D. 
Anderson  Cancer  Center,  Houston  TX  (D.  Melanogaster  tra,  human  Tra2a  and  human 
Tra2P);  J.  Stevenin,  Institut  de  Genetique  et  de  Biologie  Moleculaire  et  Cellulaire  (9G8); 
J.  Manley,  Columbia  Univ.,  New  York,  N.Y  (ASF/SF2);  and  M.  Roth,  Fred  Hutchinson 
Cancer  Research  Center,  Seattle,  WA  (SRp75,  SRp55,  SRp20,  SC35). 

All  transfections  used  HeLa  cells  and  were  done  in  the  presence  of 
lipofectAMINE™  (Gibco/BRL)  according  to  the  manufacturer’s  instructions.  Total  cell 
RNA  was  isolated  48  hours  post  transfection  using  TRIzol™  (Gibco/BRL)  following 
manufacturer’s  instructions.  Splicing  patterns  were  determined  by  RT/PCR  analysis 
using  5’  end-radiolabeled  primers  specific  for  p-globin  sequences  (5’ 
AGACACCATGCATGGTGCACC  and  3’  CCTGATCAGCGAGCTCTAG).  These 
primers  amplified  no  RNA  from  untransfected  HeLa  cells.  Amplification  conditions  were 
1  min.  at  94°C,  1  min.  at  58°C  and  1.5  min  at  72°C  for  25  cycles.  Product  DNA  was 
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denatured  and  displayed  on  a  6%  urea  gel.  RNA  products  were  quantified  in  the 
Phosphoimager.  Plasmid  pBR322  digested  with  Hpa  II  markers  is  used  in  all  displayed 
gels  for  product  size  determination.  Identified  amplification  products  resulting  from  the 
inclusion  of  one  or  two  variable  CD44  exons  were  sequenced  to  verify  identity. 

Tissues.  We  used  an  in  vivo  mouse  model  of  mammary  development  and  tumorigenesis 
to  study  normal  tissue,  pregnant  and  lactating  glands,  preneoplastic  lesions,  primary 
adenocarcinomas  and  organ  metastases  to  the  liver  and  lung  (12,  18).  The  TM 
preneoplastic  outgrowth  lines  developed  after  transplantation  of  established  mouse 
mammary  epithelial  cell  lines  (MMEL)  into  cleared  mammary  fat  pads  of  3  week  old 
syngenic  BALB/cMed  mice.  The  serially-transplanted  outgrowths  were  removed  either 
as  preneoplasias  at  8-12  weeks  after  transplantation  or  as  tumors  5-7  months  after 
transplantation.  After  removal,  the  preneoplastic  outgrowths,  primary  adenocarcinomas, 
and  metastases  were  frozen  at  -80  °C  for  further  analysis. 

Epithelial  cell  isolation  from  mature  virgin  mammary  tissue.  Isolated  mammary  fat 
pads  were  minced  and  incubated  in  DMEM  medium  with  2  mg/ml  collagenase  A 
(Boehringer  Mannheim,  Germany)  and  100  U/ml  hyaluronidase  (Boehringer  Mannheim, 
Germany)  at  37  °C  for  3  hours  and  slowly  swirled.  Afterwards  the  solution  was 
centrifuged  for  5  min  at  1000  rpm.  The  supernatant,  containing  fat  and  single  cells,  was 
discarded  and  the  pellet,  containing  mammary  epithelial  cells,  was  washed  in  PBS/5% 
FCS  and  stored  at  -80  °C  for  further  analysis. 
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Analysis  of  tissue  RNA.  Tissues  were  minced  on  dry  ice  in  before  they  were  treated 
with  a  tissue  homogenizer  (Polytron,  Littau,  Switzerland)  in  TRIzol  solution  (Gibco- 
BRL,  Gaithersburg,  MD)  with  1ml  per  100  mg  of  tissue  for  3  times  10  sec.  After  5 
minutes  of  incubation  at  26°C,  0.2  ml  of  chloroform  per  1  ml  of  TRIzol  were  added  and 
the  solution  was  shaken  vigorously,  followed  by  incubation  at  26°C  for  5  minutes.  After 
centrifugation  for  15  min  at  4°C  and  12.000  x  g  the  aqueous  phase,  containing  the  total 
RNA,  was  precipitated  with  0.5  ml  of  isopropyl  alcohol  per  1  ml  of  TRIzol  for  10 
minutes  at  26°C  and  centrifuged  for  10  min  at  12.000  x  g  at  4°C.  The  pellets  were  rinsed 
with  75%  ethanol  (1ml  of  ethanol  per  1  ml  of  TRIzol  reagent  used  for  the  initial 
homogenization),  vortexed  and  centrifuged  at  4°C  for  5  min  at  7.500  x  g  and  finally 
dissolved  in  RNAse  free  water  and  stored  at  -80°C  for  further  analysis.  Five  micrograms 
of  RNA  were  used  for  cDNA  synthesis  using  M-MuLV  reverse  transcriptase  (Perkin 
Elmer,  Branchburg,  NJ)  and  oligo-dT  primers  followed  by  PCR  using  specific  primers: 
hnRNP  Al:  5’  GGTGGTCGTGGAGGTGGTT,  3’  CCAAAATCATTGTAGCTTCC;  Tra2a:  5’ 
TATGATTACCGGTACAGAAGAAGGT,  3'  GGAAACAAGCAACACAACTGG;  and  Tra20 
5'  TGACAATACATTTCCACCACAG,  3’  GCATCAGCAATTTTTCTTCC 

Results  and  Discussion 

A  mini-gene  containing  CD44  variable  exons  4  and  5  expresses  mRNA  including 
exons  4  and  5  in  the  presence  of  some  SR  proteins.  During  a  previous  study  of  CD44 
alternative  splicing  we  discovered  that  inclusion  of  variable  exon  5  was  a  good  marker 
for  murine  mammary  tumorigenesis  in  that  this  exon  was  never  included  in  normal  RNA 
but  was  included  in  preneoplasia  and  neoplasias  (23).  When  exon  5  was  included,  exon  4 
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was  also  included.  We  therefore,  decided  to  begin  an  analysis  of  trans-acting  factors  that 
regulate  CD44  variable  splicing  by  investigating  if  known  splicing  factors  could 
influence  the  level  of  inclusion  of  exons  4  and  5.  We  created  a  mini-gene  in  which  exons 
4  and  5  and  the  natural  intron  sequences  surrounding  them  were  inserted  into  intron  1  of  a 
P-globin  mini-gene  (Figure  2B).  Transfected  into  HeLa  cells,  only  a  small  percentage  of 
the  resulting  mRNA  included  either  exon  4  or  5  (Fig.  2A  and  C). 

A  number  of  cDNA  expression  plasmids  coding  for  known  SR  proteins  were  co¬ 
transfected  with  the  reporter  mini-gene.  Plasmids  expressing  SC35  (Fig.  2 A  and  C) 
SRp55  (Fig.  5)  or  ASF/SF2  (Fig.  5)  had  no  effect  on  the  levels  of  exon  inclusion.  The 
lack  of  effect  upon  co-transfection  with  SRp55  is  somewhat  surprising  given  the  ability 
of  the  D.  melanogaster  homologue  of  SRp55,  dSRp55,  to  interact  and  be  required  for 
doublesex  splicing  (19).  In  contrast,  co-expression  of  Tra2a  increased  inclusion  of  both 
exons  4  and  5  (Fig.  2A  and  C).  Total  inclusion  rose  from  27%  to  62%  +/-  5%  in  the 
presence  of  Tra2a,  suggesting  that  Tra2a  could  be  an  alternative  splicing  factor  acting  to 
increase  CD44  exon  inclusion. 

The  CD44  variable  exons  are  weak  exons.  Therefore,  it  was  possible  that  the 
effect  of  Tra2a  on  exon  inclusion  was  non-specific.  To  see  if  Tra2a  could  increase  the 
inclusion  of  any  weak  exon,  we  tested  the  ability  of  co-transfected  Tra2a  to  increase  the 
expression  of  a  purposefully  weakened  beta  globin  exon  in  the  same  P-globin  expression 
vector  used  in  Figure  2  (Fig.  3).  This  exon  has  been  used  before  to  study  the  ability  of 
accessory  enhancer  sequences  and  trans-acting  factors  to  regulate  splicing  (4). 
Transfection  of  this  mini-gene  results  in  production  of  mRNA  in  which  less  than  20%  of 
the  RNA  contains  the  central  exon.  Co-transfection  with  Tra2a  had  no  effect  on 
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inclusion  of  this  exon,  suggesting  that  Tra2a  cannot  bind  and  affect  all  weak  exons  in 
this  mini-gene  context  (Fig.  3). 

Examination  of  the  sequences  of  exons  4  and  5  revealed  the  presence  of  multiple 
CA-rich  sequences  within  the  exons,  especially  exon  4  (Fig.  IB).  These  sequences 
closely  resembled  the  CA-rich  sequences  in  the  Drosophila  doublesex  gene  (Fig.  ID), 
suggesting  direct  binding  of  Tra2ct  to  the  CD44  exons.  To  test  this  hypothesis  we  asked 
if  purified  recombinant  Tra2a  could  bind  to  exon  4  sequences.  Using  splicing  extracts 
from  HeLa  cells,  we  were  able  to  show  that  purified  recombinant  Tra2a  bound  to  exon  4 
sequences  (data  not  shown).  Both  homologous  sequences  and  the  Drosophila  doublesex 
enhancer  were  effective  competitors  for  this  binding,  but  non-specific  sequences  were 
not.  Thus,  the  ability  of  Tra2a  to  enhance  CD44  exon  inclusion  in  HeLa  cells  is 
predictable  from  its  RNA  binding  preference. 

Relationship  of  Tra2a  regulation  of  CD44  splicing  to  regulation  of  doublesex 
splicing.  As  a  positive  control  for  the  ability  of  Tra2a  to  affect  exon  inclusion  of  an 
appropriate  exon  we  tested  the  ability  of  co-transfected  Tra2a  to  increase  recognition  of 
the  3’  splice  site  from  the  D.  melanogaster  female-specific  exon  in  a  transfected  HeLa 
cell.  It  is  known  that  human  Tra2a  can  replace  D.  melanogaster  Tra2  in  vivo  (3).  Our 
experiment  was  done  using  a  mammalian  expression  vector  in  which  the  second  exon 
began  with  the  3’  splice  site  of  the  D.  melanogaster  doublesex  female-specific  exon. 
Transfected  into  HeLa  cells,  this  reporter  is  dependent  upon  co-transfection  with  Trafor 
any  recognition  of  the  female  3’  splice  site.  In  the  presence  of  tra,  exon  recognition 
increased;  even  greater  inclusion  was  observed  in  the  presence  of  both  Traand  Tra2a 
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(data  not  shown).  Thus,  human  Tra2a  has  properties  very  similar  to  Drosophila  Tra2  and 
would  be  expected  bind  to  CA-rich  sequences. 

The  observation  that  human  Tra2a  affects  CD44  alternative  splicing  suggests  that 
Tra  might  also  be  involved.  No  vertebrate  homologue  of  Tra,  however,  has  been 
described.  It  is  unclear  if  such  a  protein  even  exists.  To  test  the  possibility,  we  asked  if 
CD44  alternative  splicing  in  our  system  would  respond  to  addition  of  D.  melanogaster 
Tra  into  HeLa  cells.  In  this  experiment  our  CD44  reporter  plasmid  was  co-transfected 
with  expression  vectors  encoding  D.  melanogaster  Tra,  human  Tra2a,  or  a  mixture  of  the 
two  (Fig.  4).  Co-transfection  with  Tra  had  a  very  modest  effect  on  CD44  alternative 
splicing.  More  importantly  the  ability  of  Tra2a  to  stimulate  CD44  alternative  splicing 
was  unaffected  by  the  simultaneous  presence  of  Tra.  This  observation  suggests  that  the 
involvement  of  Tra2a  in  CD44  splicing  is  different  than  its  involvement  in  doublesex 
splicing. 

One  other  SR  protein,  9G8,  has  been  implicated  in  Drosophila  doublesex  splicing 
(14).  This  protein  has  an  RNA  binding  domain  quite  similar  to  that  in  both  human 
SRp20  and  D.  melanogaster  rbpl.  Interestingly,  however,  transfection  of  9G8  had  no 
effect  on  alternative  splicing  in  our  CD44  mini-gene,  suggesting  another  important 
differences  between  the  alternative  splicing  of  CD44  and  the  D.  melanogaster  doublesex 
pre-mRNA  (Fig.  2A). 

Other  SR  protein  also  influence  splicing  of  CD44  alternative  exons  4  and  5.  Two 

other  SR  proteins  also  affected  the  levels  of  exon  inclusion  in  our  test  mini-gene  -  SRp20 
and  SRp75.  Like  Tra2a,  SRp20  co-transfecftion  increased  the  inclusion  of  both  exons 
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(Fig.  5).  The  level  of  inclusion,  however,  was  less  than  for  Tra2a.  The  Drosophila  SR 
protein  most-similar  to  SRp20,  rbpl,  has  been  implicated  in  the  recognition  of  the 
Drosophila  doublesex  exon,  either  through  recognition  of  the  exon  enhancer  or  through 
recognition  of  the  3’  splice  site.  Thus,  it  is  probably  not  surprising  that  SRp20  also 
activated  exon  inclusion  in  CD44  exons  containing  CA-rich  sequences. 

The  other  SR  protein  that  affected  splicing  in  our  system  was  SRp75  (Fig.  5). 
Unlike  the  other  effectors,  however,  SRp75  had  less  ability  to  increase  the  inclusion  of 
both  exons.  Instead,  mRNAs  were  observed  that  contained  one  of  the  variable  exons,  not 
both.  Because  little  is  known  about  the  RNA  binding  preferences  of  SRp75,  it  is  difficult 
to  interpret  the  effect  of  SRp75  in  this  system. 

Expression  of  Tra2a  increases  during  mammary  tumorigenesis.  Alternative  splicing 
of  CD44  increases  during  the  development  of  murine  mammary  cancer  (2,  5,  23).  Thus, 
any  factor  postulated  to  be  involved  in  the  alternative  processing  of  CD44  would  be 
expected  to  have  increased  or  altered  activity  in  tumor  cells.  To  monitor  Tra2  expression 
we  utilized  a  murine  system  of  mammary  cancer  that  we  used  previously  to  document 
increases  inCD44  alternative  splicing  during  mammary  tumorigenesis  (12,  18).  RT/PCR 
analysis  was  used  to  determine  the  splicing  patterns  of  CD44  in  both  normal  and 
neoplastic  cells.  Both  Tra2a  and  Tra2p  were  monitored  and  compared  to  an  internal 
standard  of  an  hnRNP  protein  whose  expression  is  constitutive  in  most  cells.  Neither 
Tra2a  or  Tra2p  protein  was  detected  in  normal  tissue  from  virgin  female  mice  (data  not 
shown).  Modest  levels  of  both  proteins  were  observed  in  lactating  or  pregnant  breast, 
suggesting  induction  of  both  forms  of  Tra2  during  pregnancy  (Fig.  6).  Similar  modest 


13 


Susan  M.  Berget  Metastatic  Regulation  of  Differential  Splicing  of  CD44 


levels  were  seen  in  some  but  not  all  preneoplasias  tested.  A  consistent  high  level  of  both 
RNAs  was  observed  in  cell  lines  derived  from  preneoplasias  and  all  tested  tumors  and 
their  metastases,  indicating  considerable  induction  of  both  factors  during  tumorigenesis. 
Thus,  Tra2  expression  patterns  are  consistent  with  a  requirement  for  Tra2  during  the 
increased  splicing  of  CD44  observed  during  mouse  mammary  tumorigenesis. 

Discussion.  Our  results  indicate  that  SR  proteins  may  be  involved  in  enhancement  of 
alternative  splicing  of  CD44,  especially  of  the  variable  exons  v4  and  v5  whose  inclusion 
increases  during  many  cancers,  including  breast  cancer.  Both  exons  4  and  5  contain 
multiple  copies  of  a  CA-rich  sequence  very  similar  to  the  CA-rich  sequences  observed  in 
the  D.  melanogaster  doublesex  gene,  suggesting  that  some  or  all  of  the  factors  implicated 
in  alternative  splicing  of  doublesex  might  operate  in  the  alternative  splicing  of  CD44. 
Here  we  show  that  some,  but  not  all,  of  the  proteins  implicated  in  doublesex  splicing  are 
factors  that  can  regulate  CD44  alternative  splicing  in  vivo.  Figure  7  contrasts  the  factors 
known  to  bind  to  the  D.  melanogaster  doublesex  enhancer  as  contrasted  to  the  factors 
described  in  this  report  for  CD44  exons  4  and  5. 

Similar  to  the  regulation  of  doublesex,  Tra2  and  the  rbpl-like  human  protein 
SRp20  activated  exon  inclusion.  Unlike  doublesex  Tra,  9G8,  and  SRp55  had  no  effect  on 
CD44  splicing.  In  addition  one  protein  not  implicated  in  any  known  enhancer-mediated 
splicing  event,  SRp75,  activated  exon  inclusion  of  CD44.  Our  results  underscore  the 
modular  nature  of  splicing  enhancers.  In  addition  they  suggest  how  combinatorial 
binding  of  multiple  splicing  factors  can  result  in  gene-specific  splicing  patterns. 
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Recommendations  in  relation  to  the  statement  of  work.  During  the  previous  funding 
period  our  work  was  intended  to  concentrate  on  identifying  trans-acting  factors  that 
regulate  alternative  splicing  of  CD44.  As  this  report  shows  we  are  actively 
demonstrating  a  number  of  factors  that  regulate  this  alternative  splicing  event.  In 
addition  to  the  work  described  above,  we  have  in  vitro  splicing  results  that  indicate  that 
there  is  at  least  one  additional  factor  that  we  need  to  characterize.  Addition  of  purified 
recombinant  Tra2a  to  in  vitro  extracts  results  in  increased  binding  of  a  non-SR  protein  of 
50  kDA  to  the  CD44  exon  4  CA-rich  sequences.  We  are  presently  characterizing  this 
factor.  We  are  also  currently  constructing  a  battery  of  exon  mutations  to  sort  out  which 
of  the  factors  mentioned  in  this  report  bind  to  which  of  the  possible  exon  enhancer 
sequences  within  exons  4  and  5. 

(  7)  KEY  RESEARCH  ACCOMPLISHMENTS 

V  Establishment  of  an  in  vivo  transfection  system  that  carries  out  alternative  processing 
of  CD44  variable  exons  and  responds  to  exogenous  factors. 

V  Determination  that  human  Tra2a  influences  CD44  alternative  splicing 

V  Determination  that  SRp20  and  SRp75  also  influence  CD44  alternative  splicing 

■/  Determination  that  purified  recombinant  Tra2a  binds  to  CD44  exon  sequences  in 
vitro. 

■/  Determination  that  recombinant  Tra2a  activates  the  binding  of  one  other  non-SR 
protein  to  CD44  exon  sequences 

V  Determination  that  Tra2a  is  induced  during  mouse  mammary  tumorigenesis. 
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(8)  REPORTABLE  OUTCOMES 

Manuscripts: 

Stickeler,  E.,  F.  Kittrell,  D.  Medina,  and  S.  M.  Berget.  1999.  Stage-specific  changes  in 
SR  splicing  factors  and  alternative  splicing  in  mammary  tumorigenesis.  Oncogene 
18:3574-3582. 

Abstracts 

Stickeler,  E.,  D.  Medina,  T.  Cooper,  W.  Mattox,  and  S.  M.  Berget.  1999.  CD44 
alternative  splicing.  Cold  Spring  Harbor  Pre-mRNA  Processing  Meeting,  Cold 
Spring  Harbor  New  York,  August  1999. 

Employment 

Elmar  Stickeler  has  just  accepted  a  position  as  an  Assistant  Professor  of  Obstetrics  and 
Gynecology  at  the  University  of  Frieberg,  Frieberg,  Germany  and  is  applying  for 
his  first  independent  funding. 

(9)  CONCLUSIONS 

Our  results  indicate  that  alternative  splicing  of  CD44  is  a  complicated  process 
involving  multiple  factors  that  recognize  important  sequences  within  the  specific  exons 
and  boost  splicing.  Here  we  report  the  first  function  for  human  Tra2,  an  important 
alternative  splicing  factor  in  Drosophila  used  there  for  sex  determination.  It’s  human 
function  has  not  been  known.  In  addition  we  report  that  other  SR  proteins  may  interact 
with  Tra2  to  promote  exon  recognition  in  CD44.  Interestingly  we  observe  a  different 
constellation  of  proteins  in  CD44  splicing  than  those  reported  for  D.  melanogaster 
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doublesex  splicing.  This  is  the  first  example  of  comparison  of  a  set  of  proteins 
interacting  with  similar  sequences  in  two  different  species.  They  underscore  the  ability 
of  multiple  splicing  factors  to  bind  to  a  given  regulatory  sequence  and  the  need  to 
understand  the  identity  of  all  of  the  binding  proteins  to  the  assessment  of  function. 
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(11)  APPENDICES 

Figure  Legends 

Figure  1.  Structure  and  Sequence  of  the  Human  CD44  Gene.  (A)  Diagram  of  the  human 
CD44  gene  with  the  multiple  internal  alternative  (variable)  exons  indicated  in  darker 
gray.  The  exons  studied  in  this  report  are  the  fourth  and  fifth  alternative  exons  (variable 
exons  4  &  5  or  v4  &  v5).  (B  &  C)  Sequences  of  CD44  variable  exons  4  and  5.  The  exon 
sequence  is  boxed.  Splice  sites  are  indicated.  The  repeated  CA-rich  sequences  in  each 
are  shown  in  bold  and  underlined.  A  GA-rich  repeat  known  to  be  an  enhancer  is  shown 
in  italics  in  variable  exon  5.  In  addition  a  pyrimidine-rich  sequence  that  is  a  good 
candidate  binding  site  for  SRp20  is  indicated.  (D)  Comparison  of  the  CA-rich  sequences 
in  CD44  variable  exons  4  &  5.  The  CA-rich  sequences  are  complied  and  compared  to 
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both  a  derived  consensus  and  the  D.  melanogaster  doublesex  repeat  element  known  to 
bind  Tra2. 

Figure  2.  Enhancement  of  inclusion  of  CD44  variable  exons  4  and  5  by  Tra2a.  (A) 
Quantification  of  RNA  resulting  from  the  inclusion  of  both  variable  exons  4  and  5 
following  transfection  of  HeLa  cells  with  the  mini-gene  depicted  in  (B)  in  the  presence  of 
a  co-transfected  expression  plasmid  coding  for  human  Tra2a,  SC35,  or  9G8. 
Transfections  were  performed  using  increasing  amounts  of  expression  plasmid  and  a 
constant  amount  of  the  reporter  plasmid  (expression  plasmid  DNA  levels  used  were  0,  1, 
2,  3,  or  4  pg  per  10cm  dish  of  cells).  Results  shown  are  the  average  of  three  separate 
transfections.  Error  bars  indicate  one  standard  deviation.  (B)  Diagram  of  the  reporter 
plasmid  used  in  this  experiment.  CD44  variable  exons  4  and  5  along  with  surrounding 
intron  sequences  were  inserted  into  the  first  intron  of  the  human  P-globin  gene  driven  by 
the  CMV  promoter.  Products  resulting  from  exclusion  (left)  or  inclusion  (right)  of  both 
variable  exons  are  indicated.  (C)  Results  of  one  of  the  transfections  quantified  in  (A). 
Total  RNA  from  a  transfection  was  subjected  to  RT/PCR  amplification  using 
radiolabeled  primers  specific  for  P-globin  exons  1  and  2.  These  primers  do  not  amplify 
any  RNA  from  untransfected  cells  (data  not  shown).  Amplification  products  arising  from 
RNA  including  both  one  or  none  of  the  CD44  exons  are  indicated  to  the  right  of  the  gel. 
The  extra  bands  in  the  Tra2a  lanes  result  from  cryptic  splicing  within  the  P-globin 
sequences.  Transfections  used  increasing  amounts  (0,  lanes  1  and  7;  1,  lanes  2  and  8;  2, 
lanes  3  and  9;  3,  lanes  4  and  10;  or  4,  lanes  5  and  1 1  pg)  of  indicated  expression  plasmid. 
Lane  6  is  a  size  marker. 
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Figure  3.  Tra2a  does  not  increase  inclusion  of  a  weak  p-globin  exon.  (A)  Quantification 
of  a  co-transfection  using  the  reporter  plasmid  diagrammed  in  B  and  an  expression  vector 
coding  for  human  Tra2oc.  (B)  The  reporter  plasmid  is  Dup33  which  includes  a  weakened 
internal  exon  derived  from  |3-globin  sequences.  The  first  and  third  exons  in  this  construct 
and  their  flanking  splice  sites  are  identical  to  the  mini-gene  used  in  Figure  2.  (C) 

RT/PCR  amplification  results  from  the  transfection  quantified  in  (A).  Product  RNAs 
resulting  from  inclusion  or  exclusion  of  the  central  exon  are  indicated. 

Figure  4.  Drosophila  Tra  has  little  effect  on  CD44  expression.  Co-transfections  used  the 
CD44  reporter  gene  diagrammed  in  Figure  2  and  expression  plasmids  coding  for  human 
Tra2a  and  D.  melanogaster  Tra.  (A)  Quantification  of  the  amount  of  inclusion  of  CD44 
variable  exons  4  and  5.  (B)  RT/PCR  analysis  of  RNA  products.  Products  resulting  from 
the  inclusion  of  no,  one,  or  two  CD44  exons  are  indicated.  Each  transfection  used 
increasing  amounts  of  DNA  (0,  1,  2,  3,  or  4  pg  plasmid).  The  transfection  in  the  middle 
used  increasing  Tra2a  in  the  presence  of  a  constant  amount  (2  pg)  of  Tra. 

Figure  5.  CD44  inclusion  is  also  affected  by  increasing  the  level  of  SRp75  and  SRp20. 
Co-transfections  used  the  CD44  reporter  gene  diagrammed  in  Figure  2  and  expression 
plasmids  coding  for  human  SRp75,  SRp55,  SRp20,  or  ASF/SF2.  (A)  Quantification  of 
the  amount  of  RNA  resulting  from  inclusion  of  CD44  variable  exons  4  and  5.  (B) 
RT/PCR  analysis  of  RNA  products  from  the  transfections  with  SRp75  (left)  and  SRp20 
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(right).  Products  resulting  from  the  inclusion  of  no,  one,  or  two  CD44  exons  are 
indicated. 

Figure  6.  Tra2  levels  increase  during  tumorigenesis.  RT/PCR  was  used  to  quantify  the 
levels  of  murine  Tra2a  and  Tra2(3  during  tumorigenesis  in  a  mouse  model  of  mammary 
cancer.  Total  RNA  was  extracted  from  the  indicated  tissues  and  cell  lines  and  subjected 
to  RT/PCR  amplification  using  primers  specific  to  Tra2a  (A)  or  Tra2p  (B). 
Amplification  of  a  control  gene  (hnRNP  Al)  was  used  to  demonstrate  equal  RNA 
isolation  from  each  sample  (data  not  shown). 

Figure  7.  Model  comparing  the  factors  binding  to  the  D.  melanogaster  CA-rich  exon 
enhancer  and  that  found  within  the  CD44  variable  exon  4. 
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Variable  Exon  4 

TCAATCATCGTTATCACAG 


tttcaaccacaccacgggcttttgaccac_a 


Caaaacagaaccaggactggacccagtggaacccaagccattcaaatccg 


aaaatqctacttcaqaCaaCCacaaggatgactg 


GTAATGGGTTCTGCAT 

5’ 


C 


Variable  Exon  5  y 


tttaaccatcatcacag 


cagatgtagacagaaatggcaccactgcttat 


gaaggaaac tggaacccagaagcacaccc tcccc tea tt caeca tgageatea 


t.qa.qrgaagraagagaccccacattctacaagcacaa 


GTAAGCAAGATGG 

5’ 


D  CA-rich  Repeats  in  CD44  Variable  Exons  4  &  5 


CAATCAACATT 


D.  melanogaster  doublesex  repeat 


CAACCA  CAcca 
gACCA  CAcaa 
CAACCA  CAa 
CAAgCA  CAa 
CAAtCATCgTT 
tAACCATCATc 


CD44  exon  4  sequence  1 
CD44  exon  4  sequence  2 
CD44  exon  4  sequence  3 
CD44  exon  5  sequence  1 
CD44  exon  4  3’  splice  site 
CD44  exon  5  3’  splice  site 


CAACCA  CA 


CD44  Consensus 
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Splicing  of  Dup  33 


Cell  line  made  from  preneoplasia 
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Using  a  mouse  model  of  mammary  gland  development 
and  tumorigenesis  we  examined  changes  in  both 
alternative  splicing  and  splicing  factors  in  multiple  stages 
of  mammary  cancer.  The  emphasis  was  on  the  SR  family 
of  splicing  factors  known  to  influence  alternative  splicing 
in  a  wide  variety  of  genes,  and  on  alternative  splicing  of 
the  pre-mRNA  encoding  CD44,  for  which  alternative 
splicing  has  been  implicated  as  important  in  a  number  of 
human  cancers,  including  breast  cancer.  We  observed 
step-wise  increases  in  expression  of  individual  SR 
proteins  and  alternative  splicing  of  CD44  mRNA  during 
mammary  gland  tumorigenesis.  Individual  preneoplasias 
differed  as  to  their  expression  patterns  for  SR  proteins, 
often  expressing  only  a  sub-set  of  the  family.  In  contrast, 
tumors  demonstrated  a  complex  pattern  of  SR  expres¬ 
sion.  Little  difference  was  observed  between  neoplasias 
and  their  metastases.  Alternative  splicing  of  CD44  also 
changed  through  the  disease  paradigm  such  that  tumors 
produced  RNA  containing  a  mixture  of  variable  exons, 
whereas  preneoplasias  exhibited  a  more  restricted  exon 
inclusion  pattern.  In  contrast,  other  standard  splicing 
factors  changed  little  in  either  concentration  or  splicing 
pattern  in  the  same  cells.  These  data  suggest  alterations 
in  relative  concentrations  of  specific  splicing  factors 
during  early  preneoplasia  that  become  more  pronounced 
during  tumor  formation.  Given  the  ability  of  SR  proteins 
to  affect  alternative  processing  decisions,  our  results 
suggest  that  a  number  of  pre-mRNAs  may  undergo 
changes  in  alternative  splicing  during  the  early  and 
intermediate  stages  of  mammary  cancer. 

Keywords:  SR  splicing  factors;  alternative  splicing; 
CD44;  mammary  tumorigenesis;  mouse  model 


Introduction 

Alternate  pre-mRNA  processing  contributes  signifi¬ 
cantly  to  the  developmental  regulation  of  gene 
expression  in  humans.  It  is  estimated  that  as  many  as 
25%  of  human  genes  utilize  alternative  RNA 
processing  to  produce  subtly  or  grossly  altered  gene 
products  (Moore  et  al,  1993;  Rio,  1993;  Norton,  1994; 
Berget,  1995;  Kramer,  1996).  During  the  last  year, 
reports  have  emerged  indicating  that  cancerogenesis 
induces  changes  in  alternative  processing  (Lee  and 
Feinberg,  1997;  Silberstein  et  al.,  1997;  Zhu  et  al., 
1997).  Some  of  the  genes  targeted  for  these  changes  are 
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receptor  proteins  suggesting  that  the  induced  altera¬ 
tions  in  splicing  could  have  pronounced  consequences 
for  cellular  behavior.  Little,  however  is  known  about 
how  RNA  processing  factors  are  altered  during 
tumorigenesis  so  as  to  affect  processing  outcomes. 

The  arginine-serine-rich  (SR)  proteins  (Figure  la) 
constitute  a  family  of  essential  splicing  factors  (Krainer 
et  al.,  1990a;  Ge  et  al.,  1991;  Zahler  et  al.,  1992)  that 
recognize  both  splice  sites  and  exonic  splicing 
enhancers,  and  influence  alternative  processing  deci¬ 
sions  when  their  relative  concentrations  are  altered  in 
vivo  or  in  vitro  (Ge  and  Manley,  1990;  Krainer  et  al., 
1990b;  Zahler  et  al.,  1993a;  Caceres  et  al.,  1994;  Wang 
and  Manley,  1995).  SR  proteins  have  been  observed  to 
influence  splicing  activity  via  their  binding  to  both 
splice  sites  and  special  splicing  accessory  sequences 
known  as  enhancers  (Zahler  et  al.,  1993b;  Fu,  1995; 
Manley  and  Tacke,  1996;  Valcarcel  and  Green,  1996). 
Recent  studies  have  indicated  substrate-specific  binding 
and  activity  for  individual  SR  proteins  (Fu,  1993;  Sun 
et  al.,  1993;  Zahler  et  al.,  1993a;  Caceres  et  al,  1994; 
Wang  and  Manley,  1995;  Chandler  et  al.,  1997;  Liu  et 
al. ,  1998).  Furthermore,  individual  SR  proteins  have 
distinct  tissue  distributions  (Zahler  et  al,  1992; 
Screaton  et  al.,  1995;  Caceres  and  Krainer,  1997). 
These  observations  have  led  to  suggestions  that 
alterations  in  the  levels  of  SR  proteins  could  be 
determinative  for  alternative  splicing  during  develop¬ 
ment.  This  class  of  proteins,  therefore,  become 
attractive  candidates  for  factors  whose  activity 
changes  during  tumorigenesis. 

An  ideal  system  in  which  to  study  the  relationship 
between  processing  and  the  progression  of  cancer 
would  be  one  in  which  both  normal  and  abnormal 
development  could  be  compared  and  studied  in  an 
isogenic  and  manipulatable  background.  Such  a 
system  is  available  in  an  established  in  vivo  mouse 
model  of  mammary  tumorigenesis  (Kittrell  et  al, 
1992;  Medina,  1996)  that  provides  access  to  normal 
mammary  tissues  from  mature  virgin,  pregnant  and 
lactating  females,  as  well  as  an  extensively  character¬ 
ized  set  of  preneoplasias  and  their  corresponding 
adenocarcinomas  and  metastases.  In  this  model 
system,  preneoplastic  outgrowth  lines  are  serially 
transplanted  and  maintained  for  extended  periods  of 
time  (up  to  12  months)  in  the  mammary  fat  pads  of 
syngenic  female  BALB/c  mice.  Each  outgrowth  line  is 
characterized  by  a  specific  rate  for  development  of 
adenocarcinomas  and  subsequent  metastases  (Kittrell 
et  al,  1992;  Medina,  1996).  The  outgrowths  are  clonal 
cell  populations  as  determined  by  oncogene  analysis 
(Cardiff,  1988;  Jerry  et  al,  1993). 

Using  this  system,  we  characterized  SR  protein 
expression  during  tumorigenesis.  At  the  same  time  we 
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Figure  1  SR  proteins  and  CD44  architecture,  (a)  Schematic  of  a  generic  SR  protein  with  two  N-terminal  RRM  (RNA  recognition 
motif)  motifs  that  bind  RNA  (hatched  boxes)  and  extensive  C-terminal  RS  (arginine-serine)  domains  (vertical  lines).  Thickness  of 
vertical  lines  represents  numbers  of  consecutive  RS  dipeptides,  (b)  CD44  exon  structure.  Constitutive  exons  1-5  and  16-20  flank  a 
central  region  encoding  at  least  ten  alternatively  included  (variable)  exons  (vl-vlO).  The  size  of  each  variable  exon  is  indicated 
(drawing  not  to  scale).  Arrows  indicate  primers  used  for  RT-PCR  analysis 


examined  the  alternative  splicing  of  a  pre-mRNA,  that 
coding  for  CD44,  whose  splicing  had  been  reported  to 
be  determinative  for  metastasis  (Gunthert  et  al ,  1991). 
CD44,  is  a  widely  expressed  cell  adhesion  molecule  and 
trans-membrane  glycoprotein  that  mediates  a  variety  of 
cell-cell  and  cell  matrix  interactions  (Fox  et  al. ,  1994; 
Mackay  et  al,  1994;  Stamenkovic  et  al ,  1989).  A 
number  of  CD44  isoforms  are  created  via  alternative 
pre-mRNA  splicing.  Within  the  CD44  gene  is  an 
internal  cassette  of  ten  alternatively  spliced  exons,  all  in 
the  same  translational  reading  frame,  and  coding  for 
specific  extracellular  domains  of  the  CD44  protein 
(Screaton  et  al. ,  1992;  Figure  lb).  Combinations  of 
these  variable  exons  lead  to  a  variety  of  CD44  isoforms 
(Screaton  et  al. ,  1992;  Haynes  et  al. ,  1990;  Gunthert, 
1993).  Certain  isoforms,  especially  those  including 
variable  exons  v5,  6,  and  7,  have  been  implicated  in 
the  metastasis  of  several  malignancies  (East  and  Hart, 
1993;  Cannistra  et  al,  1995;  Stickeler  et  al .,  1997; 
Wielenga  et  al. ,  1993)  and  correlated  with  survival  in 
human  breast  cancer  (Joensuu  et  al. ,  1993;  Kaufmann 
et  al. ,  1995).  Recently  it  has  been  shown  that 
alternative  splicing  of  CD44  responds  to  signal 
transduction.  Treatment  of  T-lymphoma  cells  that 
normally  express  a  form  of  CD44  lacking  variable 
exons,  with  phorbol  esters,  concanavalin  A,  or  c -ras 
induces  inclusion  of  variable  exons  (Konig  et  al .,  1998). 
Thus,  alternative  splicing  of  CD44  may  be  involved  in 
malignant  transformation  of  tissues  during  tumorigen¬ 
esis,  and  CD44  joins  the  collection  of  genes  whose 
splicing  is  altered  during  cancer  (Lee  and  Feinberg, 
1997;  Zhu  et  al ,  1997;  Silberstein  et  al,  1997). 

Recently  it  has  been  shown  that  CD44  alternative 
splicing  may  involve  inclusion  of  blocks  of  alternative 
exons  rather  than  single  isolated  exons  (Bell  et  al. , 
1998).  As  the  metastatic  potential  of  CD44  has  been 
correlated  with  only  a  single  variable  exon,  it  becomes 
of  interest  to  see  if  the  pattern  of  CD44  variable 
expression  during  tumorigenesis  may  have  been  under¬ 
estimated  by  the  heavy  attention  directed  to  this  one 
exon. 


Here  we  report  that  step-wise  alterations  in  SR 
splicing  factor  levels  and  CD44  alternative  splicing 
accompanied  the  transition  from  normal  cells  through 
preneoplasia  and  mammary  adenocarcinoma,  but  not 
metastases.  Individual  preneoplasias  exhibited  differ¬ 
ences  in  their  SR  expression  pattern  and  limited  CD44 
alternative  splicing.  Alternative  splicing  of  a  control 
gene  was  not  affected  during  preneoplasia  or 
neoplasia,  suggesting  gene-specific  alterations  in 
RNA  processing  factors  during  early  tumorigenesis, 
rather  than  a  general  increase  in  the  generic  processing 
machinery.  Tumors  showed  high  levels  of  SR  proteins, 
including  proteins  never  expressed  in  virgin  mammary 
epithelia  and  high  levels  of  inclusion  of  a  variety  of 
CD44  alternative  exons.  Our  results  suggest  that 
alterations  in  alternative  processing  may  be  important 
features  of  early  and  intermediate  stages  of  mammary 
cancer. 


Results 

For  this  study,  a  number  of  mammary  preneoplasias 
and  tumors  were  utilized,  including  tumors  and 
metastases  derived  from  two  of  the  preneoplasias. 
The  tumorigenic  and  metastatic  properties  of  two  of 
the  preneoplastic  outgrowth  lines,  TM-2L  and  TM-40, 
are  summarized  in  Table  1.  Both  TM-2L  and  TM-40 
produce  type  B  mammary  adenocarcinomas  by  12 
months,  at  an  incidence  of  21  or  51%,  respectively. 
The  main  difference  between  the  two  lines  is  the 
significant  metastatic  growth,  primarily  in  lungs  and 
occasionally  in  liver,  of  TM-40  (Table  2). 

Stepwise  alterations  in  expression  of  SR  proteins  during 
development  of  mammary  cancer 

SR  proteins  are  essential  splicing  factors  that 
participate  in  early  events  of  substrate  recognition  by 
the  processing  machinery  (Fu,  1995;  Zahler  et  al , 
1993a, b;  Manley  and  Tacke,  1996;  Valcarcel  and 
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Table  1  Tumorigenic  characteristics  of  TM  preneoplastic  outgrowth 
lines 


Outgrowth 

Transplant a 

Tumors/ 

transplants 

te50 

% 

line 

generation 

(%) 

(  months  )b 

Metastases c 

TM2L 

21-28 

21/98  (21) 

9.5 

0 

TM40 

4-13 

80/157  (51) 

11.0 

73 

“Number  of  times  outgrowth  lines  were  removed  from  fat  pads  and 
retransplanted  into  cleared  fat  pads.  bTE50  refers  to  time  for  50%  of 
the  transplants  to  form  tumors.  cMetastases  were  detected  primarily 
in  lung  and  secondarily  in  liver  in  mice  6-8  weeks  after  surgical 
removal  of  the  primary  tumors  (11/15  animals).  Metastases  are  rarely 
detected  in  animals  bearing  primary  tumors 


Table  2  Preneoplastic  origin  of  adenocarcinomas  and  organ 
metastases 


Tumor /metastases 

Outgrowth  line 

T-5839 

TM-2L 

T-6788 

TM-40 

T-7780 

TM-40 

Liver  metastases  (T-7780) 

TM-40 

Lung  metastases  (T-7262) 

TM-40 

Green,  1996).  In  addition  they  bind  to  accessory 
splicing  enhancer  sequences  to  facilitate  splicing  of 
alternative  exons  (Ge  and  Manley,  1990;  Caceres  et  al. , 
1994;  Wang  and  Manley,  1995).  SR  proteins  are 
characterized  by  an  extensive  region  containing 
arginine  and  serine  (Figure  la)  which  is  extensively 
phosphorylated.  Within  this  phosphorylated  domain  is 
the  epitope  for  a  monoclonal  antibody,  mAbl04 
(Zahler  et  al.,  1993b;  Valcarcel  and  Green,  1996).  To 
examine  alterations  in  splicing  factors  during  develop¬ 
ment  of  mammary  cancer,  we  examined  the  expression 
of  SR  proteins  in  normal  and  neoplastic  mammary 
tissues  by  Western  blots  using  mAb  104.  In  HeLa  cells, 
mAbl04  recognizes  approximately  equal  amounts  of 
the  major  SR  proteins  (SRp20,  ASF/SF2,  SC35,  9G8, 
SRp30c,  SRp40,  SRp55,  and  SRp75  of  molecular 
weights  20-75  kDa,  Zahler  et  al. ,  1993b).  Mammary 
epithelial  cells  from  mature  virgin  glands  expressed  a 
sub-set  of  the  major  SR  proteins,  yielding  dominant 
expression  of  SRp75  and  SRp55  (Figure  2a,  lane  1). 
Three  of  the  preneoplasias,  TM-2L,  TM-4  and  TM-10, 
yielded  an  SR  expression  pattern  similar  to  that  of 
mature,  virgin  mammary  gland  (Figure  2a,  lanes  2,  4 
and  5).  The  other  tested  preneoplasias,  TM-3  and  TM- 
40,  in  addition,  demonstrated  detectable  levels  of 
SRp40  and  SR  proteins  of  the  30  kDa  class, 
suggesting  induction  of  synthesis  of  these  SR  proteins 
during  the  preneoplastic  process  (Figure  2a,  lanes  3 
and  6).  Tested  adenocarcinomas  and  their  metastases 
resembled  the  expression  pattern  of  HeLa  cells, 
characterized  by  expression  of  a  broad  spectrum  of 
SR  proteins  detected  by  mAbl04  (Figure  2a,  lanes  7-9 
and  data  not  shown).  We  have  tested  a  number  of 
independently  derived  neoplasia  for  their  SR  expres¬ 
sion  profile  (data  not  shown).  All  demonstrated  SR 
expression  patterns  identical  to  those  shown  in  Figure 
2a,  lanes  7-9,  indicating  that  SR  protein  expression  is 
induced  during  tumorigenesis  regardless  of  the  expres¬ 
sion  pattern  during  pre-neoplasia.  Thus,  the  pattern  of 
SR  expression  became  more  complex  as  neoplasia 
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Figure  2  The  expression  of  SR  proteins  is  altered  in  mammary 
adenocarcinomas  and  in  some  preneoplasias,  (a)  Western  blot 
analysis  of  SR  protein  expression  in  different  breast  tissues  using 
an  antibody  specific  for  a  phosphorylated  SR  epitope  (mAb  1 04). 
Lane  1,  mature,  virgin  gland;  lanes  2-6,  preneoplasias  TM-2L, 
TM-3,  TM-4,  TM-10  and  TM-40,  respectively;  lanes  7-9 
mammary  adenocarcinomas  T-5839,  T-7780  and  T-6129,  respec¬ 
tively.  Twice  as  much  protein  (100  pg)  was  analysed  for  normal 
and  preneoplastic  tissue  (lanes  1-6)  than  for  tumors  (50  ^g,  lanes 
7-9).  (b)  Western  blot  analysis  of  the  same  tissue  analysed  in  (a) 
for  the  presence  of  the  constitutive  splicing  factor  U2AF  using  a 
polyclonal  rabbit  antibody  raised  against  a  recombinant  fragment 
of  the  human  65  kDa  subunit  missing  the  SR  domain  (Zamore  et 
al .,  1992).  Lanes  as  in  (a),  (c)  Western  blot  analysis  of  selected 
tissues  from  (a)  using  an  antibody  specific  for  SRp20.  The  utilized 
antibody  (Neugebauer  and  Roth,  1997)  is  an  anti-peptide 
antibody  specific  for  the  region  of  SRp20  between  the  RNA 
binding  domain  and  the  SR  domain.  Lane  1,  mature  virgin  gland; 
lane  2,  preneoplasia  TM2L;  lane  3,  preneoplasia  TM40;  lane  4, 
mammary  adenocarcinoma  T-5839;  lane  5,  liver  metastasis,  (d) 
Alterations  in  levels  of  mRNAs  coding  for  splicing  factor  SRp20 
during  tumorigenesis.  Levels  of  SRp20  revealed  by  RT/PCR  using 
primers  specific  for  SRp20  (see  Materials  and  methods).  Equal 
amounts  (5  fig)  of  total  cell  RNA  were  used  for  each  lane.  Lanes 
as  in  (c).  See  Figure  3b  for  RT/PCR  analysis  of  a  control  RNA  in 
these  RNA  samples 


progressed  until  a  complex  array  of  SR  proteins  were 
expressed  in  tumor  cells. 

The  expression  of  several  control  RNA  binding 
factors  was  monitored  over  the  same  disease  paradigm. 
Individual  preneoplasias  showed  identical  levels  of  the 
65  kDa  subunit  of  U2AF  (Figure  2b),  hnRNP  Al 
(Figure  3b)  and  hnRNP  A2  (Figure  4),  as  well  as  small 
nuclear  ribonucleoproteins  (snRNPs)  (data  not  shown), 
indicating  that  there  was  not  a  general  increase  in  the 
expression  of  factors  involved  in  pre-mRNA  splicing 
during  preneoplasia.  Tumors  did  show  a  slight  increase 
(estimated  to  be  twofold)  for  many  factors  as 
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Figure  3  (a)  Alternative  splicing  of  CD44  changes  during 
mammary  development  and  neoplasia.  RT-PCR  analysis  of 
CD44  mRNA  isoforms  in  breast  tissue  using  primers  specific  for 
constitutive  exons  5  and  constitutive  exon  16.  Amplification  was 
for  25  cycles.  Boxes  symbolize  included  exons  in  the  amplification 
bands.  Exon  structures  of  amplifications  products  are  indicated. 
Hatching  indicates  mixtures  of  included  variable  exons.  Lane  1, 
lactating  gland;  lane  2,  pregnant  gland;  lane  3,  mature  virgin 
gland;  lane  4,  preneoplasia  TM-2L;  lane  5,  preneoplasia  TM-40; 
lanes  6-  10,  adenocarcinomas  T-4031,  T-4032,  T-5839,  T-6129,  T- 
7780,  respectively.  All  lanes  used  equal  amounts  (5  fig)  of  total 
cell  RNA  for  amplification,  (b)  RT/PCR  analysis  of  a  control 
RNA  coding  for  hnRNP  A1  mRNA.  Samples  are  identical  to 
those  in  (a).  The  form  of  hnRNP  AP  shown  is  that  coding  for  the 
version  of  the  mRNA  lacking  exon  7b 
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Figure  4  Alternative  splicing  of  mRNA  coding  for  hnRNP  A2 
does  not  change  through  mammary  development  or  neoplasia. 
Low  cycle  RT-PCR  of  hnRNP  A2  mRNA  in  mammary  tissues 
using  primers  specific  for  two  constitutive  exons  (white  boxes) 
flanking  a  36  nucleotide  alternatively  included  exon  (hatched 
box).  Exon  skipping  produces  a  135  nt  band  (bottom),  inclusion 
in  a  171  nt  amplification  product  (top).  Lane  1,  lactating  gland; 
lane  2,  pregnant  gland;  lane  3,  mature  virgin  gland;  lane  4, 
preneoplasia  TM-2L;  lane  5,  preneoplasia  TM-40;  lanes  6-10, 
adenocarcinomas  T-4031,  T-4032,  T-5839,  T-6129  and  T-7780, 
respectively.  Equal  amounts  of  total  cell  RNA  (5  fig )  were 
analysed 
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compared  to  the  preneoplasias,  including  the  SR 
proteins  SRp75  and  SRp55  (Figure  2a).  More 
interesting,  however,  was  the  pronounced  induction 
of  expression  of  the  lower  molecular  weight  SR 
proteins  in  the  20-40  kDa  size  range  during  neoplasia. 

One  of  the  SR  proteins  induced  upon  tumorigenesis 
was  SRp20.  This  protein  is  only  weakly  detected  by 
mAbl04.  By  Western  blotting,  this  protein  was 
essentially  undetectable  in  preneoplasias.  Neoplasias 
weakly  showed  production  of  SRp20.  To  better 
examine  induction  of  this  factor  we  used  an  antibody 
specific  for  SRp20.  This  antibody  was  raised  against  an 
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SRp20  peptide  from  the  region  of  the  protein  located 
between  the  RNA-binding  domain  and  the  phosphory- 
lated  SR  region  (Neugebauer  and  Roth,  1997).  As 
shown  in  Figure  2c,  SRp20  levels  detected  by  this 
antibody  resembled  those  detected  by  the  mAbl04 
antibody.  No  expression  of  SRp20  protein  was 
detected  in  virgin  or  preneoplastic  gland.  In  contrast, 

SRp20  was  expressed  in  tumor  tissue.  To  rule  out  the 
possibility  of  a  false  positive  resulting  from  the 
presence  of  endothelial  cells  or  infiltrating  inflamma¬ 
tory  cells  in  the  excised  adenocarcinomas,  we  analysed 
SR  protein  levels  in  cultured  mammary  epithelial  cells 
established  from  the  tumors  in  this  study.  In  parallel 
with  the  tumours,  each  cell  line  demonstrated  high 
levels  of  the  entire  family  of  SR  proteins  recognized  by 
mAbl04  (data  not  shown). 

To  provide  more  sensitive  detection  of  SRp20,  we 
monitored  SRp20  mRNA  levels  using  RT-PCR 
(Figure  2d).  To  control  for  non-linear  estimations  of 
mRNA  levels  using  RT  PCR,  SRp20  mRNA  levels 
were  also  determined  by  RNase  protection  (data  not 
shown).  As  observed  by  analysis  with  the  Western 
blotting,  tumorigenesis  was  accompanied  by  a  major 
increase  in  the  levels  of  SRp20  mRNAs,  indicating  that 
the  increased  levels  of  SRp20  in  tumors  seen  with 
Western  blotting  reflected  increased  mRNA  synthesis 
rather  than  increased  phosphorylation  of  an  existing 
protein  population  (Figure  2d). 

RT-PCR  revealed  the  presence  of  a  low  level  of 
the  tested  SRp20  mRNAs  in  the  TM-40  preneoplasia 
that  had  not  been  detected  by  Western  blotting, 
suggesting  that  low  level  synthesis  of  SRp20  occurs 
in  the  TM-40  preneoplasia.  Even  using  the  power  of 
PCR  amplification,  however,  no  SRp20  was  detected 
in  preneoplasia  TM-2L  or  in  mature  virgin  tissue. 

These  results  indicate  that  SRp20  is  induced  during 
development  of  mammary  carcinogenesis.  Further¬ 
more,  it  indicates  a  progressive  increase  in  expression 
as  normal  cells  alter  to  preneoplasias  and  then  to 
tumors. 


The  splicing  of  CD44  changes  during  progression  of 
mammary  cancer 

Alteration  in  the  relative  levels  of  members  of  the  SR 
proteins  recognized  by  mAbl04  is  known  to  affect 
splicing  phenotypes.  Thus,  the  changes  in  overall 
amount  and,  more  importantly,  in  relative  amounts 
of  the  SR  proteins  displayed  in  Figure  2  should  be 
accompanied  by  changes  in  alternative  processing  of 
target  genes.  To  examine  this  possibility  we  looked  at 
the  splicing  of  CD44,  a  gene  with  multiple  alternative 
exons.  Some  of  these  exons  have  sequences  similar  to 
the  purine-rich  splicing  enhancers  known  to  be  the 
binding  sites  for  the  mAB  104-reactive  SR  proteins 
(Black,  1995;  Manley  and  Tacke,  1996;  Reed,  1996; 
Rio,  1993;  Bell  et  al ,  1998;  Liu  et  ai,  1998).  Splicing  of 
CD44  was  monitored  by  RT-PCR  analysis  of  RNA 
from  the  same  tissues  used  to  examine  SR  proteins. 
CD44  mRNA  is  expressed  as  two  basic  forms,  with  or 
without  one  or  more  of  the  alternative  internal  exons. 
Expression  of  CD44  standard  lacking  all  variable  exons 
(CD44  std)  was  examined  using  primers  complemen¬ 
tary  to  constitutive  exons  5  and  16,  which  border  the 
central  region  of  the  CD44  gene  encoding  the  ten 
alternative  exons  (Figure  lb).  RT-PCR  of  CD44 


Changes  in  SR  splicing  factors  in  mammary  tumorigenesis 

E  Stickeler  ef  a/ 

3578 

standard  mRNA  should  produce  an  amplification  band 
of  221  nucleotides;  inclusion  of  one  or  more  variable 
exons  should  produce  larger  bands.  Because  all  of  the 
variable  exons  are  approximately  the  same  size  (Figure 
lb)  inclusion  of  multiple  variable  exons  produces  an 
amplification  pattern  with  a  ladder  of  bands  differing 
in  length  by  the  average  variable  exon  length 
(approximately  115  nucleotides).  This  approach  per¬ 
mits  evaluation  of  the  overall  amount  of  CD44  mRNA 
present  in  examined  tissues  as  well  as  an  estimation  of 
the  extent  of  alternative  splicing.  Amplification 
revealed  that  mRNA  encoding  for  CD44  std,  lacking 
the  variable  exons,  was  expressed  in  all  tissues 
examined  (Figure  3).  Tumor  RNA  from  adenocarci¬ 
nomas  or  their  metastases  demonstrated  noticeable 
alternative  processing  to  reveal  a  family  of  amplifica¬ 
tion  bands  resulting  from  the  inclusion  of  multiple 
variable  exons. 

To  distinguish  CD44-specific  effects  on  alternative 
splicing  associated  with  neoplastic  transformation  from 
generic  effects  on  splicing  activity  associated  with  an 
accelerated  growth  rate,  we  examined  the  amounts  and 
spliced  isoforms  of  an  RNA  tfrat  undergoes  stochastic 
alternative  splicing  in  many  tissues.  The  hnRNP  A2 
gene  is  processed  to  produce  two  mRNA  isoforms 
which  alternatively  include  a  36  nucleotide  variable 
exon.  We  observed  relatively  equal  levels  of  total 
hnRNPA2  mRNA  in  all  tissues  examined  and 
approximately  equal  relative  levels  of  inclusion  of  the 
variable  exon  (Figure  4).  This  result  suggests  that  the 
changes  in  CD44  splicing  occurring  in  mammary 
neoplasia  is  due  to  a  change  in  the  sub-set  of  splicing 
factors  required  to  recognize  the  CD44  alternative 
exons  and  not  a  change  in  the  generic  splicing 
machinery. 

CD44  metastatic  exons  are  expressed  at  different  levels 
during  normal  mammary  gland  development  and 
tumorigenesis 

Low-cycle  RT-PCR  with  primers  specific  for  consti¬ 
tutive  exons  5  and  16  produces  complicated  amplifica¬ 
tion  patterns  because  of  the  potential  complexity  of  the 
family  of  CD44  mRNA  sequences.  To  better  examine 
variable  exons  splicing,  RNAs  from  tissue  samples 
were  analysed  by  RT-PCR  using  exon-specific  primers 
for  variable  exons  5,  6,  7  or  8  and  a  consistent  primer 
for  constitutive  exon  5  (Figure  5a -d),  followed  by 
cloning  and  subsequent  sequencing  of  the  PCR 
products.  This  approach  allowed  examination  of  the 
inclusion  of  all  alternative  exons  in  the  interval  vl-v8. 
Both  low  and  high  cycle  amplifications  were  per¬ 
formed.  We  concentrated  on  analysis  of  species 
containing  variable  exons  v5,  6  or  7,  because  these 
exons  have  been  associated  with  metastasis  in  other 
studies  (Stickeler  et  at.,  1997;  Heider  et  at.,  1995;  Sinn 
et  ah ,  1995).  Variable  exon  v8  was  also  of  interest 
because  it  is  one  of  three  so-called  ‘epithelial  ‘  exons, 
included  in  CD44  mRNA  in  a  variety  of  epithelial 
cells. 

Low  cycle  RT-PCR  analysis  of  RNA  from  multiple 
tumors  or  their  metastases  using  a  primer  specific  for 
variable  exon  5  (v5)  indicated  considerable  production 
of  RNAs  containing  v5  with  or  without  other  upstream 
variable  exons  (Figure  5a  and  data  not  shown).  The 
inclusion  patterns  for  all  of  the  examined  tumors  and 
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metastases  were  very  similar,  and  indicated  that  most 
of  the  observed  RNA  included  multiple  variable  exons 
from  the  vl-v5  region.  In  contrast,  levels  of  inclusion 
of  v5  were  lower  in  normal  dividing  cells  from 
pregnant  or  lactating  mammary  gland  and  in  the  two 
tested  preneoplasias.  Using  25-35  cycles  of  RT-PCR 
(Figure  5a  and  b),  the  major  amplified  species  in  these 
tissues  included  either  v5  alone  or  v5  and  one  other 
upstream  variable  exon.  Cloning  and  sequencing  of  the 
larger  amplification  band  indicated  that  4/5  sequenced 
clones  contained  v4  in  addition  to  v5.  This  species  was 
more  obvious  in  pregnant  and  lactating  RNA  than  in 
RNA  from  the  preneoplasias  (Figure  5a).  We  were 
unable  to  detect  any  RNA  containing  v5  in  RNA 
isolated  from  mature  virgin  mammary  gland  (Figure 
5a,  lane  3;  and  5b,  lane  2).  Because  of  the  abundance 
of  fat  tissue  in  mature  virgin  gland,  we  isolated 
epithelial  cells  from  the  extracted  virgin  mammary 
tissue.  High  cycle  amplification  of  this  RNA  yielded  no 
demonstrable  inclusion  of  exon  v5  either  (data  not 
shown).  Therefore,  for  v5  splicing,  three  patterns  of 
inclusion  were  observed  by  RT-PCR  and  sequencing: 
high  level  of  inclusion  of  multiple  exons  along  with  v5 
in  tumor  cells,  low  level  of  inclusion  of  v5  or  v5  +  4  in 
preneoplasias,  and  no  inclusion  of  v5  in  normal  virgin 
mammary  gland.  This  pattern  suggests  step-wise 
alterations  in  CD44  alternative  splicing  during 
neoplasia.  In  addition,  our  data  suggest  that  preneo¬ 
plasias  adopt  a  splicing  pattern  similar  to  that  induced 
in  mammary  epithelia  during  normal  mammary 
development  and  that  tumorigenesis  is  accompanied 
by  induction  of  a  pattern  of  frequent  alternative 
splicing  not  found  in  normal  development. 

Analysis  of  alternative  splicing  of  v6  or  7  gave 
results  similar  to  that  discussed  above  for  v5  (Figure  5c 
and  data  not  shown).  Inclusion  of  both  exons  occurred 
at  high  frequency  along  with  other  upstream  exons  in 
tumor  cell  RNA.  RNA  from  preneoplasias  and 
pregnant  gland  demonstrated  simpler  RNAs  contain¬ 
ing  fewer  variable  exons.  Sequencing  of  PCR  products 
indicated  that  for  v6,  these  RNAs  contained  only  v6. 
For  v7,  species  containing  v7  or  v7  +  4  were  observed. 
In  contrast  to  v5,  both  v6  and  7  were  included  in  RNA 
from  mature  virgin  gland  epithelial  cells  (Figure  5c, 
lane  5;  and  data  not  shown).  Like  v5,  these 
observations  suggest  that  splicing  in  preneoplasias  is 
similar  to  splicing  in  normal  proliferative  mammary 
gland  and  that  the  conversion  to  neoplasia  is 
accompanied  by  pronounced  increased  in  mRNA 
including  these  variable  exons.  Therefore,  for  this 
system  of  mammary  neoplasia,  inclusion  of  v6  is  not 
a  marker  for  either  tumorigenesis  or  metastasis. 
Instead,  a  better  marker  is  increased  inclusion  of  a 
number  of  CD44  variable  exons  including  v6. 

A  recent  report  (Bell  et  ah ,  1998)  indicated  that 
inclusion  of  blocks  of  variable  CD44  exons  was  a 
frequent  phenotype  in  mice.  Such  multiple  inclusion 
events  are  certainly  consistent  with  the  patterns  we 
observed  in  the  tumors.  Preneoplasias  and  mature 
virgin  mammary  epithelial  cells,  however,  demon¬ 
strated  inclusion  of  individual  exons  without  inclusion 
of  an  immediate  neighbor.  For  example,  v6  was 
included  without  v5  in  mature  gland  and  preneopla¬ 
sias  and  v7  was  included  with  v4  but  not  v6  and  5.  If 
inclusion  of  multiple  exons  reflects  a  mechanism 
change  with  respect  to  inclusion  of  single  exons,  then 
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Figure  5  CD44  metastatic  exons  are  expressed  in  normal  development  and  tumorigenesis.  RT-PCR  analysis  of  CD44  mRNA 
variable  isoforms  using  primers  specific  for  constitutive  exons  5  and  variable  exon  v5  (a  and  b),  variable  exon  v6  (c)  or  variable 
exon  v8  (d).  Amplification  was  for  25  (a,  c,  d)  or  35  (b)  cycles.  Boxes  symbolize  included  exons  in  the  amplification  bands.  Exon 
structures  of  amplifications  products  are  indicated.  Hatching  indicates  mixtures  of  included  variable  exons.  Majority  products  are 
indicated  where  they  existed,  (a,  c)  Lane  1,  lactating  gland;  lane  2,  pregnant  gland;  lane  3,  mature  virgin  gland;  lane  4,  preneoplasia 
TM-2L;  lane  5,  preneoplasia  TM-40;  lanes  6-10,  adenocarcinomas  T-4031,  T-4032,  T-5839,  T-6129  and  T-7780,  respectively,  (b) 
Lane  1,  pregnant  gland;  lane  2,  mature  virgin  gland;  lane  3,  preneoplasia  TM-2L;  lane  4,  preneoplasia  TM-40.  (d)  Lane  1,  lactating 
gland;  lane  2,  mature  virgin  gland;  lane  3,  preneoplasia  TM-2L;  lane  4,  preneoplasia  TM-40;  lanes  5-6,  adenocarcinomas  T-4031 
and  T-4032.  All  lanes  used  equal  amounts  (5  jug )  of  total  cell  RNA  for  amplification 


tumorigenesis  is  accompanied  by  this  mechanism 
change. 

Analysis  of  RNA  for  the  presence  of  v8  yielded  a 
surprising  result.  Like  the  other  variable  exons,  v8  was 
included  in  a  number  of  RNA  species  in  mammary 
tumor  cells  (Figure  5d,  lanes  5-6).  Unlike  v5-7, 
however,  no  v8  inclusion  could  be  observed  in 
preneoplasias  or  RNA  from  developing  mammary, 
despite  their  epithelial  origin  (Figure  5d,  lanes  3-4). 
The  observation  of  induction  of  v8  splicing  in  tumor 
RNA  indicates  that  tumorigenesis  is  accompanied  by 
alterations  in  RNA  processing  of  CD44  that  are  never 
seen  in  normal  development. 


Discussion 

During  the  last  several  years,  studies  have  suggested 
changes  in  pre-mRNA  splicing  in  human  malignancies 
(Lee  and  Feinberg,  1997;  Zhu  et  al,  1997;  Silberstein  et 
al .,  1997).  Using  an  in  vivo  model  of  mammary 
development  and  tumorigenesis,  we  were  able  to  study 
these  questions  with  a  focus  on  comparing  splicing 
phenotypes  and  factors  during  normal  mammary 
development  to  those  occurring  in  development  of 
mammary  cancer.  Neoplasia  was  accompanied  by  a 
dramatic  increase  in  expression  of  some  of  the  SR 
family  of  splicing  factors  recognized  by  mAbl04, 
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resulting  in  alteration  of  the  relative  abundance  of 
individual  SR  proteins  and  an  increase  in  the 
complexity  of  expression  of  this  important  class  of 
splicing  factors.  Given  the  number  of  genes  whose 
splicing  has  been  observed  to  be  responsive  to  relative 
SR  protein  levels  (Ge  and  Manley,  1990;  Krainer  et  al , 
1990b;  Caceres  et  al.,  1994;  Wang  and  Manley,  1995), 
this  observation  suggests  that  pronounced  changes  in 
alternative  splicing  of  a  number  of  pre-mRNAs  should 
accompany  mammary  tumorigenesis. 

The  utilized  model  system  also  permitted  examina¬ 
tion  of  SR  protein  expression  in  preneoplasias  with 
differing  capacity  to  form  tumors  or  for  those  tumors 
to  metastasize.  The  preneoplasias  characterized  in  this 
study  differed  in  their  SR  protein  expression  pattern. 
Some  of  the  neoplasias  had  mAb  104-reactive  SR 
expression  patterns  similar  to  that  seen  with  virgin 
mammary  tissue.  The  tested  preneoplasias  therefore 
appeared  to  represent  different  stages  in  the  transition 
from  normal  to  tumor  tissue  with  respect  to  SR  protein 
expression.  The  utilized  preneoplasias  arose  indepen¬ 
dently  and  have  different  tumor  and  metastasis 
producing  capabilities.  SR  expression  levels  did  not 
correlate  to  tumor  or  metastatic  incidence  frequency. 
Thus,  although  SR  expression  changes  may  be  a 
marker  for  preneoplasia,  they  are  not  predictive  for 
tumor  incidence  or  invasiveness. 

Analysis  of  other  constitutive  splicing  factors 
indicated  that  the  changes  in  SR  expression  we 
observed  were  not  the  result  of  induction  of  all 
splicing  during  neoplasia.  Levels  of  the  constitutive 
factors  U2AF65,  two  hnRNP  proteins,  and  U  snRNPs 
were  only  modestly  altered  during  preneoplasia  and 
neoplasia.  Therefore,  the  alterations  we  observed 
occurred  in  a  family  of  factors  associated  with 
alternative  splicing  and  the  recognition  of  exon 
enhancer  sequences.  This  observation  suggests  that 
the  splicing  of  multiple  target  genes  regulated  by  SR 
proteins  would  be  expected  to  be  altered  in  neoplasia, 
without  a  general  effect  on  exon  inclusion  levels. 

We  also  used  this  model  to  investigate  potential 
changes  in  alternative  splicing  in  a  gene  whose 
splicing  changes  during  cancer  have  been  correlated 
to  metastatic  potential  in  human  cancers  and  which 
has  recently  been  shown  to  alter  its  splicing  pattern 
in  response  to  signal  transduction  (Konig  et  al., 
1998).  As  with  SR  proteins,  striking  increases  in 
alternative  splicing  of  CD44  were  observed  in 
adenocarcinomas  as  compared  to  either  preneoplasias 
or  proliferative  mammary  epithelia  from  pregnant 
animals.  All  of  the  tumors  examined  displayed  a 
similar  pattern  of  exon  inclusion,  in  which  multiple 
alternative  exons  were  included  in  mRNA.  Included 
in  this  set  of  exons  were  exons  not  normally  present 
in  epithelia  of  pregnant  mammary  gland  or  in 
preneoplasias.  The  splicing  pattern  of  the  adenocarci¬ 
nomas  was  similar  to  that  of  the  lung  and  liver 
metastases  of  these  tumors,  indicating  few  changes 
accompanying  metastasis.  In  contrast  to  CD44 
alternative  splicing,  the  splicing  of  a  control  RNA 
did  not  change  during  the  transition  from  preneopla¬ 
sia  to  neoplasia.  This  difference  suggests  that  the 
pronounced  changes  in  splicing  of  CD44  observed 
during  development  of  neoplasia  arise  from  the 
induction  of  specialized  splicing  factors,  not  an 
increase  in  the  generic  splicing  machinery. 


Although  pronounced  changes  in  CD44  splicing 
correlated  with  alterations  in  the  expression  level  of 
SR  proteins  during  the  tumorigenesis  paradigm  used  in 
this  study,  it  is  not  yet  clear  if  the  two  are  related. 
Transfection  of  cells  that  normally  skip  CD44  variable 
exons  4  and  5  with  cDNAs  for  SRp55  and  ASF/SF2 
had  little  effect  on  exon  inclusion  in  a  co-transfected 
mini-gene  containing  these  two  exons  (data  not 
shown).  In  contrast,  transfection  with  cDNAs  for 
both  SC35  and  tra2  did  alter  the  percentage  of  exon 
inclusion  (Stickeler  and  Berget,  manuscript  in  prepara¬ 
tion).  Therefore,  the  splicing  of  CD44  may  be  regulated 
by  a  sub-set  of  the  SR  family  of  splicing  factors. 

When  we  examined  CD44  variable  exon  splicing  in 
epithelia  from  pregnant  mammary  gland,  we  observed 
low  levels  of  inclusion  of  some  but  not  all  the  variable 
exons.  In  particular,  v6  and  7  were  present  in  a  sub¬ 
population  of  CD44  mRNA  from  both  pregnant  tissue 
and  preneoplasias,  indicating  that  these  so-called 
‘metastatic’  exons  are  included  in  non-cancerous 
mammary  epithelial  cells.  In  contrast,  one  variable 
exons,  v5,  was  not  expressed  in  virgin  mammary  gland, 
indicating  that  alterations  in  CD44  alternative  splicing 
can  occur  during  preneoplasia  and  resemble  those 
occurring  during  pregnancy.  Other  variable  exons  that 
are  not  included  in  pregnant  tissue  (such  as  the 
‘epithelial’  exon  v8)  are  also  not  expressed  during 
preneoplasia,  but  are  included  in  tumor  RNA.  There¬ 
fore,  our  analysis  suggests  that  preneoplasias  demon¬ 
strate  a  splicing  pattern  resembling  that  occurring 
during  normal  breast  development.  Known  to  partici¬ 
pate  in  cell-cell  interactions  and  cell-matrix  interac¬ 
tions,  CD44  isoforms  could  be  involved  in  the  ductal 
outgrowth  occurring  during  early  pregnancy. 

Inclusion  of  three  of  the  CD44  variable  exons  (v5,  6 
and  7)  have  been  postulated  to  be  markers  for 
metastasis  potential  in  several  human  cancers,  includ¬ 
ing  breast  cancer.  The  initial  interest  in  these  exons 
arose  from  an  elegant  experiment  demonstrating  that 
transfection  of  a  CD44  cDNA  including  exon  v6  into  a 
tumor  cell  of  no  metastatic  potential  increased  the  rate 
of  metastasis  when  these  cells  were  placed  back  into 
syngenic  animals.  Such  experiments  expressed  the 
variant  CD44  mRNA  at  high  levels.  Our  analysis 
would  suggest  that  the  major  change  in  CD44  splicing 
occurs  during  development  of  adenocarcinomas,  not 
their  metastases  and  that  it  may  be  the  over-expression 
of  variant  CD44  mRNAs  during  the  neoplastic  state 
which  is  important  for  the  metastatic  process,  rather 
than  the  production  of  any  mRNA  including  v6.  This 
difference  may  suggest  why  experiments  monitoring 
inclusion  of  CD44  variable  exons  have  failed  to 
demonstrate  a  consistent  correlation  to  disease  pro¬ 
gression  in  several  clinical  studies  (Friedrichs  et  al., 
1995;  Muller  et  ah,  1997;  Tran  et  al,  1997). 

The  recent  report  of  a  coupling  between  signal 
transduction  and  alternative  splicing  of  CD44  concen¬ 
trated  on  variable  exon  v5.  Inclusion  of  this  exon 
showed  a  pronounced  response  to  induction  by  c -ras, 
TPA,  or  phorbol  esters  (Konig  et  al.,  1998).  Exon  v5 
contains  internal  sequences  that  resemble  known 
purine-rich  exon  enhancers  that  bind  SR  proteins. 
These  observations  suggest  that  signal  transduction 
cascades  can  lead  to  an  induction  of  SR  proteins 
needed  for  CD44  variable  splicing.  In  light  of  these 
findings,  our  observation  that  inclusion  of  v5  is 
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changed  upon  preneoplasia  supports  a  possible 
connection  between  SR  protein  levels  and  CD44 
alternative  splicing. 

Besides  CD44  a  number  of  other  genes  show  altered 
RNA  processing  during  tumorigenesis.  A  number  of 
cellular  receptors  and  hormone  genes  undergo  alter¬ 
native  processing  and  this  processing  appears  to  alter 
during  cancer.  Because  the  changes  in  these  proteins 
caused  by  alternative  processing  can  have  a  pro¬ 
nounced  effect  on  cellular  function,  it  is  important  to 
understand  the  role  of  alternative  processing  in  cancer 
and  the  mechanisms  involved.  Determination  of 
splicing  factor  constellations  patterns  in  preneoplastic 
lesions  of  the  mammary  gland  could  be  very  helpful  to 
identify  patients  in  high-risk  situations  to  develop 
invasive  breast  cancer  and  subsequent  organ  metas¬ 
tasis.  Distinct  expression  patterns  of  splicing  factors 
could  serve  as  new  markers  for  metastasis  in  breast 
cancer,  and  splicing  factors  may  represent  targets  for 
intervention  in  a  subgroup  of  patients  abnormally 
expressing  certain  factors. 


Materials  and  methods 

Tissues 

We  used  an  in  vivo  mouse  model  of  mammary  development 
and  tumorigenesis  to  study  normal  tissue,  pregnant  and 
lactating  glands,  preneoplastic  lesions,  eight  primary  adeno¬ 
carcinomas  and  organ  metastases  to  the  liver  and  lung.  The 
TM  preneoplastic  outgrowth  lines  developed  after  transplan¬ 
tation  of  established  mouse  mammary  epithelial  cell  lines 
(MMEL)  into  cleared  mammary  fat  pads  of  3-week-old 
syngenic  BALB/cMed  mice  (Kittrell  et  al .,  1992).  The 
serially-transplanted  outgrowths  were  removed  either  as 
preneoplasias  at  8-12  weeks  after  transplantation  or  as 
tumors  5-7  months  after  transplantation.  After  removal,  the 
preneoplastic  outgrowths,  primary  adenocarcinomas,  and 
metastases  were  frozen  at  —  80°C  for  further  analysis.  The 
biological  properties  of  the  outgrowth  lines  are  shown  in 
Table  1. 

Epithelial  cell  isolation  from  mature  virgin  mammary  tissue 

Isolated  mammary  fat  pads  were  minced  and  incubated  in 
DM  EM  medium  with  2  mg/ml  collagenase  A  (Boehringer 
Mannheim,  Germany)  and  100  U/ml  hyaluronidase  (Boeh¬ 
ringer  Mannheim,  Germany)  at  37°C  for  3  h  and  slowly 
swirled.  Afterwards  the  solution  was  centrifuged  for  5  min  at 
1000  r.p.m.  The  supernatant,  containing  fat  and  single  cells, 
was  discarded  and  the  pellet,  containing  mammary  epithelial 
cells,  was  washed  in  PBS/5%  FCS  and  stored  at  —  80°C  for 
further  analysis. 

RNA  isolation,  RT-PCR  and  sequencing 

Tissues  were  minced  on  dry  ice  before  they  were  treated  with 
a  tissue  homogenizer  (Polytron,  Littau,  Switzerland)  in 
TRIzol  solution  (Gibco-BRL,  Gaithersburg,  MD,  USA) 
with  1  ml  per  100  mg  of  tissue  for  three  times  10  s.  After 
5  min  of  incubation  at  26°C,  0.2  ml  of  chloroform  per  1  ml 
of  TRIzol  were  added  and  the  solution  was  shaken 
vigorously,  followed  by  incubation  at  26°C  for  5  min.  After 
centrifugation  for  15  min  at  4°C  and  12  000  g  the  aqueous 
phase,  containing  the  total  RNA,  was  precipitated  with 
0.5  ml  of  isopropyl  alcohol  per  1  ml  of  TRIzol  for  10  min  at 
26°C  and  centrifuged  for  10  min  at  12  000  g  at  4°C.  The 
pellets  were  rinsed  with  75%  ethanol  (1  ml  of  ethanol  per 
1  ml  of  TRIzol  reagent  used  for  the  initial  homogenization), 
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vortexed  and  centrifuged  at  4°C  for  5  min  at  7500  g  and 
finally  dissolved  in  RNAse  free  water  and  stored  at  —  80°C 
for  further  analysis. 

Five  micrograms  of  RNA  were  used  for  cDNA  synthesis 
using  M-MuLV  reverse  transcriptase  (Perking  Elmer, 
Branchburg,  NJ,  USA)  and  oligo-dT  primers  followed  by 
PCR  using  primers  specific  for  different  SR  proteins: 


The  standard  form  of  CD44  mRNA  (CD44  std)  was 
detected  using  PCR  primers  for  constitutive  exons  5  and  16: 

5'-ACCCCAGAAGGCTACATTTTGC 
3'-CTCATAGGACCAGAAGTTGTGG 

Inclusion  of  individual  alternative  exons  was  monitored 
using  3'  primers  specific  for  individual  alternative  exons  and  a 
primer  complementary  to  constitutive  exon  5: 

CD44v5:  (MASV5)  3'-TTGTAGCATGTGGGGTCTCCTCTT, 
CD44v6:  (MASV6)  3'-CCTTCTGTCACATGGGAGTCTTCA, 
CD44v8:  (MASV8)  3'-CACTGAAAGTGGTC  CTGTCCTGTT . 

All  amplification  reactions  used  an  ATP-labeled  5'  primer. 
The  PCR  conditions  for  all  amplifications  were  as  follows: 
25-35  cycles  (see  Figure  legends)  of  94°C  for  1  min,  59°C  for 
1  min  and  72°C  for  1.5  min.  Gel  electrophoresis  of  35  pi  of 
the  100  pi  PCR  reaction  was  performed  in  6%  denaturing 
acrylamide  gels  at  34-40  mA.  Expected  amplification 
products  are  as  follows:  142  bp  (SRp20),  155  bp 

(hnRNP  Al  without  alternative  exon  7B),  134  and  170  bp 
(hnRNP  A2  with  or  without  the  alternative  exon),  and 
221  bp  (CD44  standard  lacking  any  alternative  exon).  The 
expected  amplification  bands  for  alternatively-spliced  CD44 
mRNA  containing  only  one  variable  exon  are  207  bp  (v5), 
209  bp  (v6)  and  190  bp  (v8).  pBR  yiljHpaW  markers  were 
used  in  all  displayed  gels  for  product  size  determination. 
Identified  amplification  products  resulting  from  the  inclusion 
of  one  or  two  variable  CD44  exons  were  sequenced  to  verify 
identity.  After  cloning  of  CD44  PCR  products  into  a  pCR 
2.1  vector  (TA  cloning  kit,  Invitrogen,  CA,  USA)  sequencing 
of  these  clones  was  performed  using  the  thermo  sequenase 
method  (Amersham,  OH,  USA)  and  CD44  exon  specific 
primers  mentioned  above. 


Total  cellular  protein  was  isolated  from  the  interphase  and 
phenol  phase  from  the  initial  homogenate  after  precipitation 
of  the  DNA  with  0.3  ml  100%  of  ethanol  per  1  ml  of 
TRIzol  used  for  the  initial  homogenization.  The  samples 
were  stored  at  26°C  for  5  min  and  afterwards  centrifuged  at 
2000  g  for  5  min  at  4°C.  The  phenol-ethanol  supernatant 
was  then  precipitated  with  1.5  ml  isopropyl  alcohol  per  1  ml 
TRIzol  used  for  initial  homogenization.  After  storage  for 
10  min  at  26°C  the  protein  precipitates  were  centrifuged  at 
12  000  g  at  4°C  for  10  min.  Protein  pellets  were  washed 
three  times  in  0.3  M  guanidine  hydrochloride  in  95%  ethanol 
for  20  min  at  26°C  and  centrifuged  at  7500  g  for  5  min  at 
4°C.  The  protein  pellets  were  vortexed  after  the  final  wash  in 
2  ml  100%  ethanol,  stored  for  20  min  at  26°C  and  finally 
centrifuged  at  7500  g  for  5  min  at  4°C.  The  pellets  were 
resuspended  in  1%  SDS  solution  and  incubated  at  50°C  for 
complete  dissolution.  Insoluble  material  was  removed  by 
centrifugation  at  10  000  g  for  10  min  at  4°C.  Supernatants 
were  stored  for  further  analysis  at  —  80°C.  Gel  electrophor¬ 
esis  of  20  pg  of  total  protein  was  performed  using  a  10% 


SRp20:  5'-GCCGTGTAAGAGTGG , 

3'-AAGCTTCCTCCTTCTTGG ; 
hnRNP  Al:  5'-GGTGGTCGTGGAGGTGGTT , 

3,-CCAAAATCATTGTAGCTTCC ,  and 
hnRNP  A2:  5'-ACAGTCTGTAAGCTTTCCCC, 
3'-CTGAAGCGACTGAGTCCGCG . 


CD44  exon  5: 
CD44  exon  16: 


Protein  isolation  and  Western  blot 
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SDS  PAGE  gel  at  100  volts.  Afterwards  the  gels  were 
electroblotted  on  a  PVDF  transfer  membrane  (PolyScreen, 
NEN  Life  Science,  Boston,  MA,  USA)  at  100  volts  for  2.5  h 
at  4°C.  After  blocking  in  5%  Blotto/PBST  membranes  were 
incubated  with  mouse  IgM  mAbl04  (1:5  dilution),  the 
SRp20  specific  mouse  IgG  Ab  (1 :200  dilution)  or  polyclonal 
rabbit  anti-U2AF  Ab  1 : 6000  dilution).  The  Western  blots 
were  stained  by  chemiluminescence  (NEN  Life  Science) 
using  appropriate  horseradish-peroxidase  labeled  anti  IgM 
antibody  for  mAbl04  (Pierce,  Rockford,  IL,  USA)  or  anti 


IgG  antibody  for  SRp20  Ab  and  U2AF  Ab  a  dilution  of 
1 : 3000. 
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